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-Abstract- 

We prove that the Biichi topology, the automatic topology, the alphabetic topology and the 
strong alphabetic topology are Polish, and provide consequences of this. 
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1 Introduction 

This paper is a contribution to the study of the interactions between descriptive set theory 
and theoretical computer science. These interactions have already been the subject of many 
studies, see for instance [18, 32, 21, 30, 27, 25, 29, 6, 28, 7, 11, 9, 5]. 

In particular, the theory of automata reading infinite words, which is closely related to 
infinite games, is now a rich theory which is used for the specification and verification of 
non-terminating systems, see [12, 25]. The space £ N of infinite words over a finite alphabet 
£ being equipped with the usual Cantor topology, a natural way to study the complexity of 
w-languages accepted by various kinds of automata is to study their topological complexity, 
and particularly to locate them with regard to the Borel and the projective hierarchies. 

However, as noticed in [26] by Schwarz and Staiger and in [15] by Hoffmann and Staiger, 
it turned out that for several purposes some other topologies on a space £ N are useful, for 
instance for studying fragments of first-order logic over infinite words or for a topological 
characterisation of random infinite words (see also [14]). In particular, Schwarz and Staiger 
studied four topologies on the space £ N of infinite words over a finite alphabet £ which 
are all related to automata, and refine the Cantor topology on £ N : the Biichi topology, the 
automatic topology, the alphabetic topology, and the strong alphabetic topology. 

Recall that a topological space is Polish iff it is separable, i.e. contains a countable dense 
subset, and its topology is induced by a complete metric. Classical descriptive set theory is 
about the topological complexity of definable subsets of Polish topological spaces, as well 
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as the study of hierarchies of complexity (see [16, 25] for basic notions). The analytic sets, 
which are the projections of Borel sets, are of particular importance. Similar hierarchies of 
complexity are studied in effective descriptive set theory, which is based on the theory of 
recursive functions (see [24] for basic notions). The effective analytic subsets of the Cantor 
space (2 n ,tc) are highly related to theoretical computer science, in the sense that they 
coincide with the sets recognized by some special kind of Turing machines (see [31]). 

In [26], Schwarz and Staiger prove that the Biichi topology, which is generated by the 
regular w-languages, is metrizable. It is separable, by definition, because there are only 
countably many regular cc-languages. It remains to see that it is completely metrizable to 
see that it is Polish. This is one of the main results proved in this paper. 

We now give some more details about the topologies studied by Schwarz and Staiger in 
[26] that we investigate in this paper. Let E be a finite alphabet with at least two symbols. 
We will consider the topology rg on E N generated by the set B 5 of sets accepted by an 
unambiguous Biichi Turing machine. Let E* be the set of finite sequences of elements of E. 
The following topologies on E N , related to automata, are considered in [26]. 
h The Biichi topology tb , generated by the set B^ of w-regular languages, 
h The automatic topology ta , generated by the set B^ of rc-closed w-regular languages 
(this topology is remarkable because all rc-closed w-regular languages are accepted by 
deterministic Biichi automata), 

The alphabetic topology r a , generated by the set B a of sets of the form B w a '■= {wa \ 
a £ A H }, where w £ E* and AC E (this topology is useful for investigations in restricted 
first-order theories for infinite words), 

h The strong alphabetic topology r s , generated by the set B s of sets of the form 

S Wi a '■= {wa | a £ ^4 n A Ma £ A Vfc £ N 3i > k a{i) = a}, 

where w £ E* and A C E (this topology is derived from r a , and considered in [4], together 
with r a ). 

In [26], Schwarz and Staiger prove that these topologies are metrizable. We improve this 
result: 

► Theorem 1. Let z £ {C, <5, B, A, a, s}. Then t z is Polish. 

From this result, it is already possible to infer many properties of the space E N , where E 
is a finite alphabet, equipped with the Biichi topology. In particular, we first get some results 
about the a-algebra generated by the w-regular languages. It is stratified in a hierarchy 
of length wi (the first uncountable ordinal) and there are universal sets at each level of 
this hierarchy. Notice that this er-algebra coincides with the a- algebra of Borel sets for the 
Cantor topology, and that a set is Borel for the Cantor topology if and only if it is Borel for 
the Biichi topology, but the levels of the Borel hierarchy differ for the two topologies. For 
instance an w-regular set which is non-IlS] for the Cantor topology is clopen (i.e., A?) for 
the Biichi topology. Therefore the results about the existence of universal sets at each level 
of the cr-algebra generated by the w-regular languages are really new and interesting. We 
derive many other properties from the polishness of the Biichi topology. 

2 Background 

We first recall the notions required to understand fully the introduction and the sequel (see 
for example [25, 30, 16, 24]). 
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2.1 Theoretical computer science 

A Biichi automaton is a tuple A = (Q , E, <5, Qi , Q /), where E is the input alphabet, Q is the 
finite state set, <5 is the transition relation, Qi and Qf are the sets of initial and final states. 
The transition relation 8 is a subset of Q x E x Q. 

A run on some sequence a £ E N is a sequence (q n )i eN € of states such that go is 
initial (go £ Qi) and (g^, cr(i), gi+i) is a transition in 8 for each i > 0. It is accepting if it 
visits infinitely often final states, that is, qi £ Qf for infinitely many i. An input sequence er 
is accepted if there exists an accepting run on a. The set of accepted inputs is denoted L(A). 
A set of infinite words is called co-regular if it is equal to L{A) for some automaton A. 

A Biichi automaton is actually similar to a classical finite automaton. A finite word w 
of length n is accepted by an automaton A if there is sequence (g t ) ? ;< n of n + 1 states such 
that go is initial (go £ Qi), q n is final (g„ £ Qf) and (g*, cr(i), g i+1 ) is a transition in 8 for 
each 0 < i < n. The set of accepted finite words is denoted by U(A). A set of finite words is 
called regular if it is equal to U(A) for some automaton A. 

Let us recall that the w-power of a set U of finite words is defined by 

U u = {a £ E n | 3 (tOj)ieN € U N s.t. a = w 0 wiW 2 ■ ■ ■}■ 

The w-powers play a crucial role in the characterization of w-regular languages (see [1]): 

► Theorem 2 (Biichi). Let E be a finite alphabet , and L C E N . The following statements 
are equivalent: 

1. L is co-regular, 

2 . there are 2 n regular languages ( Ui)i <n and (V))j<„ such that L = UiVfi. 

The closure properties of the class of w-regular languages mentioned in the introduction 
are the following (see [25] and [26]). If E is a set and w £ E*, then w defines the usual 
basic clopen set N w := {a £ E N | w is a prefix oftr} of the Cantor topology tc (so Be := 
{0} U {N w | w £ E*} is a basis for tc)- 

► Theorem 3 (Biichi). The class of to-regular languages contains the usual basic clopen sets 
and is closed under finite unions and intersections, taking complements, and projections. 

We now turn to the study of Turing machines (see [3, 30]). 

A Biichi Turing machine is a tuple Ai = (T,,Q,T,8,qo,Qf), where E and T are the 
input and tape alphabets satisfying E C T, Q is the finite state set, <5 is the transition 
relation, go is the initial state and Qf is the set of final states. The relation 8 is a subset of 
(QxT)x(QxTx {- 1 , 0 , 1 }). 

A configuration of Ad is a triple (g, 7 ,j) where g £ Q is the current state, 7 £ T N is the 
content of the tape and the non-negative integer j £ N is the position of the head on the 
tape. 

Two configurations (g, 7 , j) and (q',”f',f) of M. are consecutive if there exists a transition 
(g, a, q ', b,d) £ <5 such that the following conditions are met: 

1. 7 (j) = a, Y( j ) = b and 7 (?) = 7 '{i) for each i 7 ^ j. This means that the symbol a is 
the replaced by symbol b at position j and that all other symbols on the tape remain 
unchanged. 

2. the two positions j and j 1 satisfy the equality j' = j + d. 

A run of the machine M on some input 0 £ E N is a sequence (pi, 7 i, jfjien of consecutive 
configurations such that po = qo, 7 o = & and jo = 0. It is complete if the head visits all 
positions of the tape. This means that for each integer N, the exists an integer i such that 
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ji > N. The run is accepting if it visits infinitely often final states, that is, Pi £ Qf for 
infinitely many integers i. The w-language accepted by Xi is the set of inputs a such that 
there exists an accepting and complete run on a. 

Notice that other accepting conditions have been considered for the acceptance of infinite 
words by Turing machines, like the T or Muller ones (the latter one was firstly called 
3-acceptance), see [3, 30]. Moreover several types of required behaviour on the input tape 
have been considered in the literature, see [31, 10, 8]. 

A Biichi automaton A is in fact a Biichi Turing machine whose head only move forwards. 
This means that each of its transition has the form (p, a, q , b , d ) where d = 1. Note that the 
written symbol b does not matter since each position of the tape is just visited once and the 
symbol b is never read. 

2.2 Descriptive set theory 

Classical descriptive set theory takes place in Polish topological spaces. 

We first recall that if d is a distance on a set X , and (x n )„ e N is a sequence of elements of X, 
then the sequence (a; n ) ng N is called a Cauchy sequence if 

VfceN 3N £ N Vp,y > N d(x p ,x p/ ) < 

In a topological space X whose topology is induced by a distance d, the distance d and the 
metric space (X,d) are said to be complete if every Cauchy sequence in X is convergent. 

► Definition 4. A topological space X is a Polish space if it is 

1 . separable (there is a countable dense sequence (x n ) in X), 

2. completely metrizable (there is a complete distance d on X which is compatible with the 
topology of X). 

Effective descriptive set theory is based on the notion of recursive function. A function 
from to is said to be recursive if it is total and computable. By extension, a relation 
is called recursive if its characteristic function is recursive. 

► Definition 5. A recursive presentation of a Polish space X is a pair ((x„) ng N ) d) such that 

1. (a: n )neN is dense in X, 

2. d is a compatible complete distance on X such that the following relations P and Q are 
recursive: 

TYl 

P(i,j,m,k) «=> d{xi,Xj) <-j—j> 

Q(i,j,m,k) d{xi,Xj ) < ——-. 

k+1 

A Polish space X is recursively presented if there is a recursive presentation of it. 

Note that the formula (p, q) K>• 2 p (2q + 1) — 1 defines a recursive bijection N 2 —>• N. One 
can check that the coordinates of the inverse map are also recursive. They will be denoted 
n i ^ (n)o and n <—> (n) i in the sequel. These maps will help us to define some of the basic 
effective classes. 

► Definition 6. Let {(x n ) n ^,d) be a recursive presentation of a Polish space X. 

1. We fix a countable basis of X: B(X , n) is the open ball Bd(x^ 0 , 
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2. A subset S of A is semirecursive , or effectively open (denoted S £ £°) if 

S = U B ( X ’ /(»))> 

new 

for some recursive function /. 

3. A subset S of A is effectively closed (denoted S £ 77°) if its complement ~>S is semire¬ 
cursive. 

4. One can check that a product of two recursively presented Polish spaces has a recursive 
presentation, and that the Baire space N N has a recursive presentation. A subset S of X 
is effectively analytic (denoted S £ S\) if there is a 77° subset C of X x N n such that 

S = tt 0 [C] := {a; G A | 3a G N n (a:, a) £ C}. 

5. A subset S of X is effectively co-analytic (denoted S £ III ) if its complement -<S is 
effectively analytic, and effectively Borel if it is in and III (denoted S £ A\). 

6. We will also use the following relativized classes: if X, Y are recursively presented Polish 
spaces and y G P, then we say that A C X is in £}(y) if there is S £ A) (Y x X) such 
that A = S y := {x £ X \ (y,x) £ S'}. The class 77 l(y) is defined similarly. We also set 
A\{y):=El{y)CMll{y). 

The crucial link between the effective classes and the classical corresponding classes is as 
follows: the class of analytic (resp., co-analytic, Borel) subsets of Y is equal to U a eN N X i( a ) 
(resp., U Q eN N ^i( a )- U a eN N ^i( a ))- This allows to use effective descriptive set theory to 
prove results of classical type. In the sequel, when we consider an effective class in some £ N 
with £ finite, we will always use a fixed recursive presentation associated with the Cantor 
topology. The following result is proved in [31], see also [8]: 

► Theorem 7. Let £ be a finite alphabet, and L C £ N . The following statements are 
equivalent: 

1. L = L{M) for some Biichi Turing machine Xi, 

2. L £ S]. 

We now recall the Choquet game played by two players on a topological space A'. Players 
1 and 2 play alternatively. At each turn i, Player 1 plays by choosing an open subset Ui and 
a point Xi £ Ui such that Ui C V}_ 1 , where has been chosen by Player 2 at the previous 
turn. Player 2, plays by choosing an open subset Vi such that Xi £ V) and V) C Ui. Player 2 
wins the game if D/gn ^ We now recall some classical notions of topology. 

► Definition 8. A topological space A is said to be 
_ Tf if every singleton of A is closed, 

h regular if for every point of A and every open neighborhood U of x, there is an open 
neighborhood V of x with PC U, 
h second countable if its topology has a countable basis, 
h zero-dimensional if there is a basis made of clopen sets, 

h strong Choquet if A is not empty and Player 2 has a winning strategy in the Choquet 
game. 

Note that every zero-dimensional space is regular. The following result is 8.18 in [16]. 

► Theorem 9 (Choquet). A nonempty, second countable topological space is Polish if and 
only if it is T\, regular, and strong Choquet. 
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Let X be a nonempty recursively presented Polish space. The Gandy-Harrington topology 
on X is generated by the E\ subsets of X , and denoted Ex- By Theorem 7, this topology is 
also related to automata and Turing machines. As there are some effectively analytic sets 
whose complement is not analytic, the Gandy-Harrington topology is not metrizable (in fact 
not regular) in general (see [3E.9] in [24]). In particular, it is not Polish. 


3 


Proof of the main result 


The proof of Theorem 1 is organized as follows. We provide below four properties which 
ensure that a given topological space is strong Choquet. Then we use Theorem 9 to prove 
that the considered spaces are indeed Polish. 

Let E be a countable alphabet. The set S N is equipped with the product topology of the 
discrete topology on S, unless another topology is specified. This topology is induced by a 
natural metric, called the prefix metric which is defined as follows. For a ^ a' £ E N , the 
distance d is given by 

d(a, a 1 ) = 2~ r where r = min{n | a(n) ^ a' (n)}. 


When E is finite this topology is the classical Cantor topology. When E is countably 
infinite the topological space is homeomorphic to the Baire space N N . 

Let E and T be two alphabets. The function which maps each pair (a, 7 ) £ E N x T N to 
the element (cr(0), 7 ( 0 )), (cr(l), 7 ( 1 )),... of (E x T) N is a homeomorphism between E N x T N 
and (E x T) N allowing us to identify these two spaces. 

If E is a set, a £ E N and l £ N, then a\l is the prefix of a of length l. 

We set 2 := {0,1} and := {a £ 2 N | Vfc £ N 3 i > k a(i) = 1}. This latter set is 
simply the set of infinite words over the alphabet 2 := { 0 , 1 } having infinitely many symbols 1 . 

We will work in the spaces of the form E N , where E is a finite set with at least two 
elements. We will fix a topology te on S N , and a basis Be for te- We consider the following 
properties of the family (te,Be)e 5 using the previous identification: 

(PI) B S contains the usual basic clopen sets N w , 

(P2) Be is closed under finite unions and intersections, 

(P3) Be is closed under projections, in the sense that if T is a finite set with at least two 
elements and L £ Bext, then 7r 0 [L] £ Be, 

(P4) for each L £ Be there is a closed subset C of E N x (i.e. C is the intersection of a 
closed subset of the Cantor space E H x 2 N with E N x Poo) which is in Bex 2 , and such 
that L = 7 r 0 [C]. 

► Theorem 10. Assume that the family (te,Be)e satisfies the properties (P1)-(P4)- Then 
the topologies te are strong Choquet. 

Proof. We first describe a strategy r for Player 2. Player 1 first plays ao £ E N and a Ts-open 
neighborhood Uq of ao- Let L 0 in Be with oy £ L 0 C Uq. Property (P4) gives Co with 
Lq = 7 r 0 [Co]. This gives ao £ Poo such that (oo,ao) £ Co- We choose 1° £ N big enough to 
ensure that if := ao|?o, then has at least a coordinate equal to 1. We set wq := cr 0 11 
and Vo : = ^oIC) D {N Wo x iV s o)]. By properties (P1)-(P3), Vo is in Be and thus Ts-open. 
Moreover, ao £ Vo Q L 0 C Co, so that Player 2 respects the rules of the game if he plays Vo- 
Now Player 1 plays a± £ Vo and a te- open neighborhood U\ of ay contained in Vo- 
Let L\ in Be with ay £ L\ C U\. Property (P4) gives Ci with L\ = 7 r 0 [Ci]. This 
gives ay £ P^ such that ( 0 y,ay) £ Cy. We choose l\ £ N big enough to ensure that if 
s\ j := ay|Zg, then sj has at least one coordinate equal to 1 . As ay £ Vg, there is a' 0 £ Poo 
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such that (< 7 i, 0 !q) £ Cq fl (N Wo x N s o). We choose > Z° big enough to ensure that 
if Si := at' 0 \li, then s° has at least two coordinates equal to 1. We set w\ := a±\2 and 
V\ := 7 To[Co D (N Wl x N s o)] n 7 r 0 [Ci D (N Wo x 7V s i)]. Here again, V\ is rs-open. Moreover, 
<Ti £ V\ C [/i and Player 2 can play Vi. 

Next, Player 1 plays a 2 £ V\ and a r^-open neighborhood U 2 of <j 2 contained in Vj . 

Let L 2 in Bv with a 2 £ L 2 C U 2 . Property (P4) gives C 2 with L 2 = ’Kq[C 2 \. This 

gives a 2 £ such that (cr 2 ,a 2 ) £ C 2 . We choose 1% £ N big enough to ensure that if 
Sq := a 2 \lo, then Sq has at least one coordinate equal to 1 . As cr 2 £ Vi, there is a.\ £ P^ 
such that (a- 2 ■ a \) £ C\ fl (N Wo x N s i). We choose l\ > Iq big enough to ensure that if 
s) := ot'i\l\, then s} has at least two coordinates equal to 1. As a 2 £ Vi, there is Qq £ P^ 
such that (<t 2 , Oq) £ C 0 fl (N Wl x N s o). We choose big enough to ensure that if 

s 2 := a ol^ 2 i then s° has at least three coordinates equal to 1. We set w 2 := <t 2 \3 and 
V 2 := 7 r 0 [Co n (N W2 x N s o)] n 7 r 0 [Ci 0 (N Wl x N s i)] n 7 T 0 [C 2 0 (N Wo x N s 2 )]. Here again, V 2 
is rs-open. Moreover, a 2 £ V 2 C U 2 and Player 2 can play V 2 . 

If we go on like this, we build wi £ S !+1 and sj 1 £ 2* such that iu 0 C w\ C ... 
and Sg ^ s" ^ ... This allows us to define a := lim/^oo wi £ £ N and, for each n £ N, 

a n := limj^oo sf £ 2 N . Note that a n £ P^ since sf has at least l + 1 coordinates equal to 

1. As (a,a n ) is the limit of ('U)qsJ 1 ) as l goes to infinity and N WI x meets C n (which is 
closed in S N x Poo), (a,a n ) £ C n . Thus 

=n Ln -n v - 

nSN neN n£N neN 

so that t is winning for Player 2. ◄ 

3.1 The Gandy-Harrington topology 

We have already mentioned that the Gandy-Harrington topology is not Polish in general. 
However, it is almost Polish as it fulfills the four properties (P1)-(P4). 

Let E be a finite alphabet with at least two elements and let X be the space E N equipped 
with the topology ty, := Ex generated by the family Bs of El subsets of X. Note that the 
assumption of Theorem 10 are satisfied. Indeed, (P1)-(P3) come from 3E.2 in [24]. For (P4), 
let F be a 77° subset of X x N n such that L = ttq [F]. Let ip be the function from N n to 2 N 
defined by <p(/3) = .... Note that ip is a homeomorphism from N n onto P^, and 

recursive (which means that the relation tp(0) £ A(2 N ,n) is semirecursive in (3 and n). This 
implies that C := (Id xip)[F] is suitable (see 3E.2 in [24]). 

Note that 7 v is second countable since there are only countably many El subsets of X 
(see 3F.6 in [24]), Tf since it is finer than the usual topology by the property (PI), and strong 
Choquet by Theorem 10. 

One can show that there is a dense basic open subset Ox of (X, ts) such that S fl O.y 
is a clopen subset of (0 .y,ts) for each E^ subset S' of X (see [19]). In particular, (£Ix,te) 
is zero-dimensional, and regular. As it is, just like {X, r s ), second countable, T) and and 
strong Choquet, (Ox,r s ) is a Polish space, by Theorem 9. 

3.2 The Biichi topology 

Let E be a finite alphabet with at least two symbols, and X be the space E N equipped with 
the Biichi topology tb generated by the family Bb of w-regular languages in X. Theorem 29 
in [26] shows that tb is metrizable. We now give a distance which is compatible with tb- 
This metric was used in [14] (Theorem 2 and Lemma 21 and several corollaries following 
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Lemma 21). A similar argument for subword metrics is in [[15], Section 4]. If A is a Biichi 
automaton, then we denote |A| the number of states of A. We say that a Biichi automaton 
separates x and y iff 


(x G L(A) A y^L(A)) V (y G L(A ) A x^L(A)). 


The distance S on £ N is then defined as follows: for x,y G £ N , S(x,y) = 


0 if x = y, 

2~ n iix^y, 

where n := min{|A| | A is a Biichi automaton which separates x and y}. We now describe 
some properties of the map <5. This is the occasion to illustrate the notion of complete metric. 


► Proposition 11. 

1 . the map S defines a distance on £ N , 

2. the distance S is compatible with tb, 

3. the distance S is not complete. 

Proof. 1. If x, y G £ N , then 5(x, y) = S(y, x ), by definition of S. Let x, y, z G £ N , and assume 
that 5(x,y) +5(y,z) < S(x,z) = 2~ n . Then S(x,y) < 2~ n and S(y,z) < 2~ n hold. In 
particular, if A is a Biichi automaton with n states then it does not separate x and y and 
similarly it does not separate y and 2 . Thus either x,y,z G L{A) or x,y,z ^ L{A). This 
implies that the Biichi automaton A does not separate x and z. But this holds for every 
Biichi automaton with n states and then S(x, z) < 2~ n . This leads to a contradiction and 
thus S(x, z ) < S(x, y) + S(y, z) for all x,y,z G £ N . This shows that A is a distance on £ N . 

2. Recall that an open set for this topology is a union of w-languages accepted by Biichi 
automata. Let then L(A) be an w-language accepted by a Biichi automaton A having n 
states, and x G L(A). We now show that the open ball B(x, 2 U n + 1 )) with center x and 
(5-radius 2~( n+1 ' ) is a subset of L(A). Indeed, if S(x, y) < 2 “(” +1 ) < 2~ n , then x and y cannot 
be separated by any Biichi automaton with n states, and thus y G L{A). This shows that 
L{A) (and therefore any open set for tb ) is open for the topology induced by the distance 
S. Conversely, let B(x, r ) be an open ball for the distance 5, where r > 0 is a positive 
real. It is clear from the definition of the distance S that we may only consider the case 
r = 2~ n for some natural number n. Then y G B{x , 2 ~ n ) if and only if x and y cannot be 
separated by any Biichi automaton with p < n states. Therefore the open ball B(x, 2~ n ) is 
the intersection of the regular w-languages L(Ai) for Biichi automata Ai having p < n states 
and such that x G L(Ai) and of the regular w-languages T, n \L(Bi) for Biichi automata £>, 
having p < n states and such that x£L{Bi). The class of regular w-languages being closed 
under complementation and finite intersection, the open ball B(x,2~ n ) is actually a regular 
w-language and thus an open set for tb- 

3. Without loss of generality, we set £ = {0,1} and we consider, for a natural number 
n > 1, the w-word X n = 0 n! • 1 • 0“ over the alphabet {0,1} having only one symbol 1 
after n! symbols 0, where n\ := n X (n — 1) X • • • x 2 x 1. Let now m > n > k and A be a 
Biichi automaton with k states. Using a classical pumping argument, we can see that the 
automaton A cannot separate X n and X m . Indeed, assume first that X n G L(A). Then, 
when reading the first k symbols 0 of X n , the automaton enters at least twice in a same state 
q. This implies that there is a sequence of symbols 0 of length p < k which can be added 
several times to the word X n so that the resulting word will still be accepted by A. Formally, 
any word 0 n '+ l P ■ 1 • CU, for a natural number l > 1, will be accepted by A. In particular 
to! = n! x (n + 1) x • • • x m = n\ + n\ x ^((n + 1) x • • • x to) — 1^ is of this form and thus 
X m G L(A). A very similar pumping argument shows that if X m G L(A), then X n G L(A). 
This shows that 5(X n ,X m ) < 2~ k and finally that the sequence (X n ) is a Cauchy sequence 
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for the distance S. On the other hand if this sequence was converging to an w-word x then x 
should be the word 0“ because tb is finer than tq. But 0“ is an ultimately periodic word 
and thus it is an isolated point for tb- This would lead to a contradiction, and thus the 
distance <5 is not complete because the sequence (X n ) is a Cauchy sequence which is not 
convergent. ◄ 


Proposition 11 gives a motivation for deriving Theorem 1 from Theorem 10. Note that 
the assumption of Theorem 10 are satisfied. Indeed, (P1)-(P3) come from Theorem 3. We 
now check (P4). 

► Lemma 12. Let £ be a finite set with at least two elements, and L C £ N be an uj-regular 
language. Then there is a closed subset C of £ N x P^, which is Lo-regular as a subset of 
(£ x 2 ) n identified with £ N x 2 N , and such that L = ttq[C\. 

Proof. Let A = (H,Q,S,Qi,Qf) be a Biichi automaton and let L = L{A) be its set of 
accepted words. Let Xf be the characteristic function of the final states. It maps each 
state q to 1 if q £ Qf and to 0 otherwise. The function Xf is extended to by setting 
a = Xf((Qn)ne n) where a{n) = Xfi.Pn)- Note that a run p of A is accepting if and only if 
Xf(p) e Poo- 

Let C be the subset of £ N x Poo defined by 

C := {(cr, a) £ £ N x P^ | 3 p run of A on cr s. t. a = Xf(p)}- 

By definition of C, L = no [C] . Let K be the subset of £ N x 2 N x Q V: be defined by 

K ■= {(cr,a,p) e £ n x 2 h x Q n \ p is a run of A on a s. t. a = Xf(p)}- 

Since K is compact as a closed subset of a compact space and C = 7r s r,- x2 m [K] n (E N x Poo), 
the subset C is a closed subset of £ N x Poo- It remains to show that C is indeed w-regular. 
Let A be defined by 

A := {(p, ( a,e),q) £ Q x (E x 2) x Q \ (p, a, q) € <5 A (e = 1 <!=> p G Qf)}. 


This allows us to define a Biichi automaton by A' := (E x 2, Q , A, Qi , Qf). Note that 

(cr,a) G L(A') 3(si) i6N G Q n (s 0 £ Qi A Vi G N (s», (cr(i),a(i)),s i+ i) G A) A 

Vfc gN 3 i> k Si £ Qf 
tioG Pqo A 3(sj)jgpj G (so G Qi A Vi G N (sj, <r(i), Sj+i) G S A 

(a(i) = 1 Si £ Qf)) 


(cr, a) £ C. 


Thus C = L(A') is w-regular. 


◄ 


► Corollary 13. Let E be a finite set with at least two elements. Then the Biichi topology tb 
is zero-dimensional and Polish. 

Proof. As there are only countably many possible automata (up to identifications), Bs 
is countable. This shows that tb is second countable. It is Tf since it is finer than the 
usual topology by the property (PI), and strong Choquet by Theorem 10. Moreover, it is 
zero-dimensional since the class of w-regular languages is closed under taking complements 
(see Theorem 3). It remains to apply Theorem 9. ◄ 
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3.3 The other topologies 

Proof of the main result. It is well known that (£ n ,tc) is metrizable and compact, and 
thus Polish. 

By Theorem 3.4 in [23], the implication (iii) =>■ (i), A}(£ H ) is a basis for a zero-dimensional 
Polish topology on £ N . Recall that a Biichi Turing machine is unambiguous if every 
w-word <7 £ £ N has at most one accepting run. By Theorem 3.6 in [8], a subset of £ N 
is A\ if and only if it is accepted by an unambiguous Biichi Turing machine. Therefore 
Ba = Z\}(£ H ) is a basis for the zero-dimensional Polish topology Tg. 
h Corollary 13 gives the result for the Biichi topology. 

b Let (X,t) be a Polish space, and (C n ) ne N be a sequence of closed subsets of (X,t). By 
13.2 in [16], the topology t„ generated by rU{C„} is Polish. By 13.3 in [16], the topology 
Too generated by [J neN r n is Polish. Thus the topology generated by r U {C n \ n £ N}, 
which is T^, is Polish. This shows that the automatic topology and the alphabetic 
topology are Polish since they refine the usual product topology on £ N . 
h Note that S w a is a Gg subset of £ N , and thus a Polish subspace of £ N , by 3.11 in [16]. 
Note also that £ N = Ub/acs U U0 /ACs,i«e£*,6e£\A Swb.A, and that this union is 
disjoint. As all the sets in this union are open for the strong alphabetic topology, they 
are also clopen for this topology. This shows that this topology is the countable sum of 
its restrictions to the sets in this union. As the strong alphabetic topology coincides with 
the usual topology on each of these sets, it is Polish, by 3.3 in [16]. ◄ 

4 Consequences for our topologies 

4.1 Consequences not directly related to the polishness, concerning 
isolated points 

Notation. If z e {C,S,B,A,a,s}, then the space (£ n ,t 2 ) is denoted S z . The set of 
ultimately periodic w-words on £ is denoted Ult := {n • | u, v £ £*\{0}}, and P := 

£ N \Ult. 

1. As noted in [26], Ult is the set of isolated points of Sb and Sa (recall that a point a £ £ N 
is isolated if {cr} is an open set). Indeed, each singleton {u-v^} formed by an ultimately 
periodic w-word is an w-regular language, and thus each ultimately periodic cc-word is 
an isolated point of 5 a- Conversely, if {cr} is T^-open, then it is w-regular and then the 
w-word cr is ultimately periodic (because any countable w-regular language contains only 
ultimately periodic w-words, see [1, 25, 30]). 

2. Every nonempty w-regular set contains an ultimately periodic w-word, [1, 25, 30]. In 
particular, the set Ult of isolated points of Sb and Sa is dense , and a subset of Sb or 
5 a is dense if and only if it contains Ult. 

3. Let A be a topological space in which the set I of isolated points is dense, for example 
Sb by (2). 

(a) A subset of X is nowhere dense (i.e., its closure has empty interior) if and only if it 
is meager (i.e., it is a countable union of nowhere dense sets), indeed, a meager set 
does not meet I. 

(b) Recall that a subset L of a topological space Y has the Baire property if there is an 
open subset O of Y such that the symmetric difference LAO := ( L\0 ) U ( 0\L ) is 
meager. This is equivalent to say that L = G U M, where G is Gg (i.e. a countable 
intersection of open sets) and M is meager (see 8.23 in [16]). Every subset of A has 
the Baire property (which is very uncommon, see for exemple 8.24 in [16]). Indeed, 
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we can even say that every subset of X can be written L = O U N , where O is open 
and N is nowhere dense (O is L D I, and N is L\I). 

(c) Recall that a topological space is a Baire space if the intersection of countably many 
dense open sets is dense. By the Baire category theorem, every completely metrizable 
space is Baire. The space X is Baire in a very strong way: the intersection of any 
family of dense sets is dense, since a subset of X is dense if and only if it contains I. 
This is very unusual, since for example we can find two disjoint countable dense sets 
in R. 

(d) A consequence of (b), (c) and 8.38 in [16] is that any map from X into a second 
countable Polish space is continuous on a dense Gg subset of A'. Note that if X is 
Sb or Sa, then any map from A' into a topological space is continuous on the dense 
open set Ult of ultimately periodic u>- words. 

(e) There is a strong version of the Kuratowski-Ulam theorem (see 8.41 in [16]): assume 
that A' and Y are topological spaces in which the set of isolated points is dense. 
Then this property also holds in the product A' x Y, so that a subset of X x Y 
is meager if and only if it contains no isolated point, which is also equivalent to 
the fact that for each isolated point x £ X (resp., y £ Y), the vertical (resp., 
horizontal) section at x (resp., y) contains no isolated point. An interesting example 
is the case of an infinitary rational relation R(A) C £ N x T N accepted by a 2- 
tape Biichi automaton A which may be synchronous or asynchronous. Indeed if 
£ N x r N is equipped with the product topology of the Biichi topologies on £ N and 
T N then a non-empty rational relation is always non-meager. This follows easily 
from the fact that Dom (R(A)) = {x £ £ N | 3y £ T N (x,y) £ -R(A)} is a regular 
w-language. Thus Dom(l?(A)) contains an ultimately periodic word x and then 
{y £ T N | (x,y) £ R(A)} is also a non-empty regular w-language and so it contains 
also an ultimately periodic word y. 

4.2 Consequences of the polishness 

4.2.1 Consequences related to Cantor-Bendixson Theorem 

We consider our topologies on £ N , where £ is a finite set with at least two elements. We 
give a (non exhaustive) list of results. We often refer to [16] when classical descriptive set 
theory is involved. The reader may read this book to see many other results. 

4. (a) The union P U Ult is the Cantor-Bendixson decomposition of Sb and Sa (see 6.4 
in [16]). This means that P is perfect (i.e., closed without isolated points) and Ult 
is countable open. Let us check that P is perfect. We argue by contradiction, so 
that we can find a £ P and an w-regular language L such that {ex} = L\Ult. Note 
that L C {a} U Ult is countable. But a countable regular w-language contains only 
ultimately periodic words (see [25]), and thus L C Ult, which is absurd. 

(b) The closed subspace (P,tb) of Sb is homeomorphic to the Baire space N N . Indeed, it 
is not empty since Ult is countable and £ N is not, zero-dimensional and Polish as a 
closed subspace of the zero-dimensional Polish space Sb- By 7.7 in [16], it is enough 
to prove that every compact subset of (P,tb) has empty interior. We argue by 
contradiction, which gives a compact set K. Note that there is an w-regular language 
L such that P (~l L is a nonempty compact subset of K , so that we may assume that 
K = P D L. Theorem 2 gives ( Ui)i <n and (V))j<„ with L = \J i<n ^ ' V?■ On the 
other hand, L is not countable since every countable regular w-language contains 
only ultimately periodic words and K = P n L is non-empty. Thus n > 0 and, for 
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example, UqVq is not countable. 

This implies that we can find vq,vi G Vo which are not powers of the same word. 

Indeed, we argue by contradiction. Let vq G Vq \ {0}, and v G £* of minimal 

length such that vq is a power of v. We can find wq,w\,... G Vo\{ 0} such that 
a := wqWi... v u . Fix a natural number i. Then wt and vq are powers of the same 
word w. By Corollary 6.2.5 in [22], v and w are powers of the same word u, and vq 

too. By minimality, u = v, and Wi is a power of v. Thus <r = v u , which is absurd. 

Let uq G Uq, and V := {uo}-{uo, V\} u . Note that P D L' is a rs-closed subset of K, 
so that it is rs-compact. As the identity map from (P ft L' , tb) onto (P fl L', tc), 
(where tc is the Cantor topology), is continuous, P fl L' is also rc-compact. But 
the map at—> uo'i’a(o)' u a(i) ■ • • is a homeomorphism from the Cantor space onto L', by 
Corollaries 6.2.5 and 6.2.6 in [22]. Thus P fl L' = Z/\Ult is a dense closed subset of 
L'. Thus PflI' = L, which is absurd since uq ■ Vq G L'\P. 

(c) By (b) and 7.9 in [16], for each Polish space Y, we can find a rs-closed set f CP 
(an intersection of w-regular sets) and a continuous bijection from P onto Y. In 
particular, if Y is not empty, then there is a continuous surjection from P onto Y 
extending the previous bijection. By 7.15 in [16], if moreover Y is perfect, then there 
is a continuous bijection from P onto Y. 

(d) A consequence of (b), Corollary 13, and 7.10 in [16], is that the space Sb is not K a 
(i.e., countable union of compact sets). 

4.2.2 Consequences related to Borel sets 

5. (a) The a-algebra A z generated by B 2 is exactly the u-algebra of Borel subsets of Sc 
(generated by the open subsets of Sc)- Indeed, the identity map from S z onto Sc 
is a continuous bijection. By 15.2 in [16], the inverse map is Borel, so that the two 
Borel structures coincide. It remains to note that the a -algebra generated by B 2 is 
exactly the u-algebra of Borel subsets of S z . 

(b) We can define a natural hierarchy in A z : let SJ := (Bj) ff be the set of countable 
unions of elements of B 2 , and, inductively on 1 < £ < wi, II| := {->P | L G S|} and 
E| := ([I <£ njpo-. This hierarchy is actually the Borel hierarchy in the family of 
Borel subsets of S z , in the sense that S| = £°(<S 2 ) and II| = n°(5 2 ) (see 11.B in 
[16]; we will also consider A| := S|nII| = A°(5 2 ) and IIq(iS 2 ) := B 2 ). By the main 
result and 22.4 in [16], this hierarchy is strict and of length ui i (the first uncountable 
ordinal). This comes from the existence of universal sets for these classes (see 22.3 in 
[16]). 

Recall that a subset U of 2 N x £ N is universal for a class T of subsets of E N if it is in T 
and the T subsets of E N coincide with the vertical sections U x = {y G £ N | (x, y) G U } 
of U. 

Note that a set is Borel for tc if and only if it is Borel for any topology r 2 , but 
the levels of the Borel hierarchy may differ for the two topologies. For instance any 
singleton associated with an ultimately constant word is not open for the Cantor 
topology and is actually open for the S , Biichi, automatic, alphabetic and strong 
alphabetic topologies. Therefore the existence of universal sets for each level of the 
Borel hierarchy of S z is different from the existence of universal sets for each level 
of the Borel hierarchy of Sc if z ^ C, and could lead to other consequences or new 
applications to other domains of theoretical computer science. 

(c) The Wadge theorem holds (see 22.10 in [16]): let 2 G {S,B}, ( > 1 be a countable 
ordinal. A subset L of £ N is in £|\II| if and only if it is in £|, and for each 
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zero-dimensional Polish space X and any E° subset B of X there is / : X —► S z 
continuous with B = / -1 (L) (we can exchange £| and II|). 

(d) The Saint Raymond theorem holds (see 35.45 in [16]): let £,r be finite sets with 
at least two elements, L C S N x T N be in A z with E| vertical sections. Then 
L = UneN Lm where L n is in A z and has Ilf vertical sections. 

(e) By 4.(b) and 13.7 in [16], for each L in Ab , we can find a Ts-closed set F C P 
(an intersection of w-regular sets) and a continuous bijection from F onto L. In 
particular, if L is not empty, then there is a continuous surjection from P onto L 
extending the previous bijection. 

4.2.3 Consequences related to analytic sets 

6 . Recall that the analytic sets are the projections of the elements of A z . We saw in (5).(a) 
that t z and tc have the same Borel sets. This implies that r~ and tq have the same 
analytic sets. Then the following items (a)-(d) are not new, but we cite them as interesting 
results about the topology t z . 

(a) The class of analytic sets contains strictly A z because of the existence of universal 
sets (see 14.2 in [16]). 

(b) The Lusin separation theorem holds (see 14.7 in [16]): assume that L,M C E N are 
disjoint analytic sets. Then there is S in A z such that L C S C -i M (we say that S 
separates L from M). 

(c) The Souslin theorem holds (see 14.11 in [16]): the elements of A z are exactly the 
analytic sets whose complement is also analytic. 

(d) By 14.13 in [16], an analytic set L has the perfect set property : either L is countable, 
or L contains a copy of Sc (this copy has size continuum and is compact). 

(e) The parametrization theorem for Borel sets holds (see 35.5 in [16]). We say that a 
set is co-analytic if its complement is analytic. We can find a co-analytic subset T> of 
2 n , an analytic subset S of 2 N x E N , and a co-analytic subset V of 2 N x E N such that 
the vertical sections S a and V a of S and V at any point a oiV are equal, and the 
elements of A z are exactly the sets S a for some a in V. 

4.2.4 Consequences related to uniformization problems 

7. Let X , Y be sets, L C X x Y , and f:X—>Y be a partial map. We say that f uniformizes 
L if the domain of / is ttq \L\ and the graph of / is contained in L. In the sequel, E and T 
will be finite sets with at least two elements. 

(a) Recall that if L C E N x T N is an infinitary rational relation, then L can be uniformized 
by a function whose graph is w-rational (see Theorem 5 in [2]). Moreover, the inverse 
image of an w-regular language by such a function is itself w-regular, and thus the 
function is continuous for tb- 

(b) The Arsenin-Kunugui theorem holds (see 18.18 in [16]): if L C E N x T N is in A z and 
has K a vertical sections, then L can be uniformized by a function whose graph is in 
A~ (a K a set is a countable union of compact sets). 

(c) Let ( > 1 be a countable ordinal. The class E^ has the number uniformization 
property : if A' is a zero-dimensional Polish space and L C X x N is E°, then L can 
be uniformized by a function whose graph is E°. Consequently, for z £ {<5, B}, 

_ £| has the generalized reduction property : if (L„) ng N is a sequence of E| subsets 
of E N , then there is a sequence (B n ) ne ^ of pairwise disjoint £| subsets of £ N with 
UneN Bn = UneN ^ n and B n C L n , 
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_ n| has the generalized separation property : if ( L n ) ne ^ is a sequence of II| subsets 
of S N with empty intersection, then there is a sequence ( B n ) n of A| subsets of 
S N with empty intersection and B n DL n (see 22.16 in [16]). 

Moreover, 

_ the class of co-analytic sets has the number uniformization property and the 
generalized reduction property, 

- the class of analytic sets has the generalized separation property (see 35.1 in [16]). 

(d) The Novikov-Kondo theorem holds (see 36.14 in [16]): if L C S N x T N is co-analytic, 
then L can be uniformized by a function whose graph is co-analytic. 

5 Concluding Remarks 

We have obtained in this paper new links and interactions between descriptive set theory 
and theoretical computer science, showing that four topologies considered in [26] are Polish, 
and providing many consequences of these results. 

Notice that this paper is also motivated by the fact that the Gandy-Harrington topology, 
generated by the effective analytic subsets of a recursively presented Polish space, is an 
extremely powerful tool in descriptive set theory. In particular, this topology is used to prove 
some results of classical type (without reference to effective descriptive set theory in their 
statement). Among these results, let us mention the dichotomy theorems in [13, 17, 19, 20]. 
Sometimes, no other proof is known. Part of the power of this technique comes from the nice 
closure properties of the class 27* of effective analytic sets (in particular the closure under 
projections). 

The class of w-regular languages has even stronger closure properties. So our hope is that 
the study of the Biichi topology, generated by the w-regular languages, will help to prove 
some automatic versions of known descriptive results in the context of theoretical computer 
science. 

From the main result, we know that there is a complete distance which is compatible 
with t z . It would be interesting to have a natural complete distance compatible with Tg. We 
leave this as an open question for further study. 
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